A Progressive Strategy for DNA Resequencing Problem

نویسندگان

  • Chia-Wei Lu
  • Chuan Yi Tang
  • R. C. T. Lee
  • R. O. C
چکیده

In this paper, we are going to solve the DNA resequencing problem. We are given a set of subseuqences obtained by the next generation sequencing (NGS) technology and we are asked to assembly them into a DNA sequence with the aid of a reference sequence. In this paper, we propose a progressive strategy to solve the resequencing problem. Our algorithm allows us to map data to reference sequence efficiently. Compared with other available tools, our approach allows us to be able to map much more subsequences to the reference sequence. 1 A Brief Introduction of the String Matching Problems As we shall see later, the DNA resequencing problem is a variation of the traditional string matching problems. Therefore, a brief introduction of the string matching problems and notations often used will be first given below. In string matching problems, there are a text n and a pattern m where both ’s and i ’s are characters. We assume that . We denote j i i by and j i i by . We shall consider two different kinds of string matching problems: Exact string matching problem and approximate string matching problems. t t t T L 2 1 = p p p P L 2 1 = i t p n m ≤ t t t L 1 + ) , ( j i T p p p L 1 + ) , ( j i P For the exact string matching problem, we are given a text n and a pattern m . Our job is to find whether t t t T L 2 1 =

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Resequencing and Feature Assignment on an Automated Assembly Line

We consider the problem of resequencing a prearranged set of jobs on a moving assembly line with the objective of minimizing changeover costs. A changeover cost is incurred whenever two consecutive jobs do not share the same feature. Features are assigned from a set of job-specific feasible features. Resequencing is limited by the availability of offline buffers. The problem is motivated by a v...

متن کامل

Resequencing and Feature Assignment on a Moving Assembly Line

We consider the problem of resequencing a pre-arranged set of jobs on a moving assembly line with the objective of minimizing changeover costs. A changeover cost is incurred whenever two consecutive jobs do not share the same feature. Features are assigned from a set of job-specific feasible features. Re-sequencing is limited by the availability of offline buffers. The problem is motivated by a...

متن کامل

Reconstructed Ancestral Sequences Improve Pathogen Identification Using Resequencing DNA Microarrays

We describe the benefit of using reconstructed ancestral sequences (RAS) on resequencing microarrays for rapid pathogen identification, with Enterobacteriaceae rpoB sequences as a model. Our results demonstrate a sharp improvement of call rate and accuracy when using RASs as compared to extant sequences. This improvement was attributed to the lower sequence divergence of RASs, which also expand...

متن کامل

A two-stage stochastic rule-based model to determine pre-assembly buffer content

This study considers instant decision-making needs of the automobile manufactures for resequencing vehicles before final assembly (FA). We propose a rule-based two-stage stochastic model to determine the number of spare vehicles that should be kept in the pre-assembly buffer to restore the altered sequence due to paint defects and upstream department constraints. First stage of the model decide...

متن کامل

Cancer Gene Prioritization for Targeted Resequencing Using FitSNP Scores

BACKGROUND Although the throughput of next generation sequencing is increasing and at the same time the cost is substantially reduced, for the majority of laboratories whole genome sequencing of large cohorts of cancer samples is still not feasible. In addition, the low number of genomes that are being sequenced is often problematic for the downstream interpretation of the significance of the v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010